Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
1.
Life Sci Alliance ; 7(5)2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38418088

RESUMO

Detecting structural variants (SVs) in whole-genome sequencing poses significant challenges. We present a protocol for variant calling, merging, genotyping, sensitivity analysis, and laboratory validation for generating a high-quality SV call set in whole-genome sequencing from the Alzheimer's Disease Sequencing Project comprising 578 individuals from 111 families. Employing two complementary pipelines, Scalpel and Parliament, for SV/indel calling, we assessed sensitivity through sample replicates (N = 9) with in silico variant spike-ins. We developed a novel metric, D-score, to evaluate caller specificity for deletions. The accuracy of deletions was evaluated by Sanger sequencing. We generated a high-quality call set of 152,301 deletions of diverse sizes. Sanger sequencing validated 114 of 146 detected deletions (78.1%). Scalpel excelled in accuracy for deletions ≤100 bp, whereas Parliament was optimal for deletions >900 bp. Overall, 83.0% and 72.5% of calls by Scalpel and Parliament were validated, respectively, including all 11 deletions called by both Parliament and Scalpel between 101 and 900 bp. Our flexible protocol successfully generated a high-quality deletion call set and a truth set of Sanger sequencing-validated deletions with precise breakpoints spanning 1-17,000 bp.


Assuntos
Doença de Alzheimer , Humanos , Doença de Alzheimer/genética , Sequenciamento Completo do Genoma/métodos
2.
Alzheimers Dement ; 20(3): 2058-2071, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38215053

RESUMO

INTRODUCTION: Clinical research in Alzheimer's disease (AD) lacks cohort diversity despite being a global health crisis. The Asian Cohort for Alzheimer's Disease (ACAD) was formed to address underrepresentation of Asians in research, and limited understanding of how genetics and non-genetic/lifestyle factors impact this multi-ethnic population. METHODS: The ACAD started fully recruiting in October 2021 with one central coordination site, eight recruitment sites, and two analysis sites. We developed a comprehensive study protocol for outreach and recruitment, an extensive data collection packet, and a centralized data management system, in English, Chinese, Korean, and Vietnamese. RESULTS: ACAD has recruited 606 participants with an additional 900 expressing interest in enrollment since program inception. DISCUSSION: ACAD's traction indicates the feasibility of recruiting Asians for clinical research to enhance understanding of AD risk factors. ACAD will recruit > 5000 participants to identify genetic and non-genetic/lifestyle AD risk factors, establish blood biomarker levels for AD diagnosis, and facilitate clinical trial readiness. HIGHLIGHTS: The Asian Cohort for Alzheimer's Disease (ACAD) promotes awareness of under-investment in clinical research for Asians. We are recruiting Asian Americans and Canadians for novel insights into Alzheimer's disease. We describe culturally appropriate recruitment strategies and data collection protocol. ACAD addresses challenges of recruitment from heterogeneous Asian subcommunities. We aim to implement a successful recruitment program that enrolls across three Asian subcommunities.


Assuntos
Doença de Alzheimer , População norte-americana , Humanos , Doença de Alzheimer/genética , Projetos Piloto , Asiático/genética , Canadá , Fatores de Risco
3.
Nat Commun ; 15(1): 684, 2024 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-38263370

RESUMO

The heterogeneity of the whole-exome sequencing (WES) data generation methods present a challenge to a joint analysis. Here we present a bioinformatics strategy for joint-calling 20,504 WES samples collected across nine studies and sequenced using ten capture kits in fourteen sequencing centers in the Alzheimer's Disease Sequencing Project. The joint-genotype called variant-called format (VCF) file contains only positions within the union of capture kits. The VCF was then processed specifically to account for the batch effects arising from the use of different capture kits from different studies. We identified 8.2 million autosomal variants. 96.82% of the variants are high-quality, and are located in 28,579 Ensembl transcripts. 41% of the variants are intronic and 1.8% of the variants are with CADD > 30, indicating they are of high predicted pathogenicity. Here we show our new strategy can generate high-quality data from processing these diversely generated WES samples. The improved ability to combine data sequenced in different batches benefits the whole genomics research community.


Assuntos
Doença de Alzheimer , Humanos , Exoma , Biologia Computacional , Confiabilidade dos Dados , Genótipo
4.
BMC Genomics ; 25(1): 115, 2024 Jan 26.
Artigo em Inglês | MEDLINE | ID: mdl-38279154

RESUMO

BACKGROUND: Short tandem repeats (STRs) are widely distributed across the human genome and are associated with numerous neurological disorders. However, the extent that STRs contribute to disease is likely under-estimated because of the challenges calling these variants in short read next generation sequencing data. Several computational tools have been developed for STR variant calling, but none fully address all of the complexities associated with this variant class. RESULTS: Here we introduce LUSTR which is designed to address some of the challenges associated with STR variant calling by enabling more flexibility in defining STR loci, allowing for customizable modules to tailor analyses, and expanding the capability to call somatic and multiallelic STR variants. LUSTR is a user-friendly and easily customizable tool for targeted or unbiased genome-wide STR variant screening that can use either predefined or novel genome builds. Using both simulated and real data sets, we demonstrated that LUSTR accurately infers germline and somatic STR expansions in individuals with and without diseases. CONCLUSIONS: LUSTR offers a powerful and user-friendly approach that allows for the identification of STR variants and can facilitate more comprehensive studies evaluating the role of pathogenic STR variants across human diseases.


Assuntos
Genoma Humano , Repetições de Microssatélites , Humanos , Repetições de Microssatélites/genética , Células Germinativas , Sequenciamento de Nucleotídeos em Larga Escala
5.
Alzheimers Dement ; 20(2): 1123-1136, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37881831

RESUMO

INTRODUCTION: The National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site Alzheimer's Genomics Database (GenomicsDB) is a public knowledge base of Alzheimer's disease (AD) genetic datasets and genomic annotations. METHODS: GenomicsDB uses a custom systems architecture to adopt and enforce rigorous standards that facilitate harmonization of AD-relevant genome-wide association study summary statistics datasets with functional annotations, including over 230 million annotated variants from the AD Sequencing Project. RESULTS: GenomicsDB generates interactive reports compiled from the harmonized datasets and annotations. These reports contextualize AD-risk associations in a broader functional genomic setting and summarize them in the context of functionally annotated genes and variants. DISCUSSION: Created to make AD-genetics knowledge more accessible to AD researchers, the GenomicsDB is designed to guide users unfamiliar with genetic data in not only exploring but also interpreting this ever-growing volume of data. Scalable and interoperable with other genomics resources using data technology standards, the GenomicsDB can serve as a central hub for research and data analysis on AD and related dementias. HIGHLIGHTS: The National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) offers to the public a unique, disease-centric collection of AD-relevant GWAS summary statistics datasets. Interpreting these data is challenging and requires significant bioinformatics expertise to standardize datasets and harmonize them with functional annotations on genome-wide scales. The NIAGADS Alzheimer's GenomicsDB helps overcome these challenges by providing a user-friendly public knowledge base for AD-relevant genetics that shares harmonized, annotated summary statistics datasets from the NIAGADS repository in an interpretable, easily searchable format.


Assuntos
Doença de Alzheimer , Estados Unidos , Humanos , Doença de Alzheimer/genética , Estudo de Associação Genômica Ampla , National Institute on Aging (U.S.) , Genômica , Bases de Dados Factuais , Predisposição Genética para Doença/genética
6.
medRxiv ; 2023 Nov 16.
Artigo em Inglês | MEDLINE | ID: mdl-38014121

RESUMO

Studies of the genetics of Alzheimer's disease (AD) have largely focused on single nucleotide variants and short insertions/deletions. However, most of the disease heritability has yet to be uncovered, suggesting that there is substantial genetic risk conferred by other forms of genetic variation. There are over one million short tandem repeats (STRs) in the genome, and their link to AD risk has not been assessed. As pathogenic expansions of STR cause over 30 neurologic diseases, it is important to ascertain whether STRs may also be implicated in AD risk. Here, we genotyped 321,742 polymorphic STR tracts genome-wide using PCR-free whole genome sequencing data from 2,981 individuals (1,489 AD case and 1,492 control individuals). We implemented an approach to identify STR expansions as STRs with tract lengths that are outliers from the population. We then tested for differences in aggregate burden of expansions in case versus control individuals. AD patients had a 1.19-fold increase of STR expansions compared to healthy elderly controls (p=8.27×10-3, two-sided Mann Whitney test). Individuals carrying > 30 STR expansions had 3.62-fold higher odds of having AD and had more severe AD neuropathology. AD STR expansions were highly enriched within active promoters in post-mortem hippocampal brain tissues and particularly within SINE-VNTR-Alu (SVA) retrotransposons. Together, these results demonstrate that expanded STRs within active promoter regions of the genome promote risk of AD.

7.
Res Sq ; 2023 Oct 05.
Artigo em Inglês | MEDLINE | ID: mdl-37886469

RESUMO

Structural variations (SVs) are important contributors to the genetics of human diseases. However, their role in Alzheimer's disease (AD) remains largely unstudied due to challenges in accurately detecting SVs. We analyzed whole-genome sequencing data from the Alzheimer's Disease Sequencing Project (N = 16,905) and identified 400,234 (168,223 high-quality) SVs. Laboratory validation yielded a sensitivity of 82% (85% for high-quality). We found a significant burden of deletions and duplications in AD cases, particularly for singletons and homozygous events. On AD genes, we observed the ultra-rare SVs associated with the disease, including protein-altering SVs in ABCA7, APP, PLCG2, and SORL1. Twenty-one SVs are in linkage disequilibrium (LD) with known AD-risk variants, exemplified by a 5k deletion in complete LD with rs143080277 in NCK2. We also identified 16 SVs associated with AD and 13 SVs linked to AD-related pathological/cognitive endophenotypes. This study highlights the pivotal role of SVs in shaping our understanding of AD genetics.

8.
medRxiv ; 2023 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-37693521

RESUMO

Alzheimer's Disease (AD) is a common disorder of the elderly that is both highly heritable and genetically heterogeneous. Here, we investigated the association between AD and both common variants and aggregates of rare coding and noncoding variants in 13,371 individuals of diverse ancestry with whole genome sequence (WGS) data. Pooled-population analyses identified genetic variants in or near APOE, BIN1, and LINC00320 significantly associated with AD (p < 5×10-8). Population-specific analyses identified a haplotype on chromosome 14 including PSEN1 associated with AD in Hispanics, further supported by aggregate testing of rare coding and noncoding variants in this region. Finally, we observed suggestive associations (p < 5×10-5) of aggregates of rare coding rare variants in ABCA7 among non-Hispanic Whites (p=5.4×10-6), and rare noncoding variants in the promoter of TOMM40 distinct of APOE in pooled-population analyses (p=7.2×10-8). Complementary pooled-population and population-specific analyses offered unique insights into the genetic architecture of AD.

9.
medRxiv ; 2023 Aug 29.
Artigo em Inglês | MEDLINE | ID: mdl-37693582

RESUMO

INTRODUCTION: Despite a two-fold increased risk, individuals of African ancestry have been significantly underrepresented in Alzheimer's Disease (AD) genomics efforts. METHODS: GWAS of 2,903 AD cases and 6,265 cognitive controls of African ancestry. Within-dataset results were meta-analyzed, followed by gene-based and pathway analyses, and analysis of RNAseq and whole-genome sequencing data. RESULTS: A novel AD risk locus was identified in MPDZ on chromosome 9p23 (rs141610415, MAF=.002, P =3.68×10 -9 ). Two additional novel common and nine novel rare loci approached genome-wide significance at P <9×10 -7 . Comparison of association and LD patterns between datasets with higher and lower degrees of African ancestry showed differential association patterns at chr12q23.2 ( ASCL1 ), suggesting that the association is modulated by regional origin of local African ancestry. DISCUSSION: Increased sample sizes and sample sets from Africa covering as much African genetic diversity as possible will be critical to identify additional disease-associated loci and improve deconvolution of local genetic ancestry effects.

10.
medRxiv ; 2023 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-37745545

RESUMO

Structural variations (SVs) are important contributors to the genetics of numerous human diseases. However, their role in Alzheimer's disease (AD) remains largely unstudied due to challenges in accurately detecting SVs. Here, we analyzed whole-genome sequencing data from the Alzheimer's Disease Sequencing Project (ADSP, N=16,905 subjects) and identified 400,234 (168,223 high-quality) SVs. We found a significant burden of deletions and duplications in AD cases (OR=1.05, P=0.03), particularly for singletons (OR=1.12, P=0.0002) and homozygous events (OR=1.10, P<0.0004). On AD genes, the ultra-rare SVs, including protein-altering SVs in ABCA7, APP, PLCG2, and SORL1, were associated with AD (SKAT-O P=0.004). Twenty-one SVs are in linkage disequilibrium (LD) with known AD-risk variants, e.g., a deletion (chr2:105731359-105736864) in complete LD (R2=0.99) with rs143080277 (chr2:105749599) in NCK2. We also identified 16 SVs associated with AD and 13 SVs associated with AD-related pathological/cognitive endophenotypes. Our findings demonstrate the broad impact of SVs on AD genetics.

11.
Front Aging Neurosci ; 15: 1168638, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37577355

RESUMO

To better capture the polygenic architecture of Alzheimer's disease (AD), we developed a joint genetic score, MetaGRS. We incorporated genetic variants for AD and 24 other traits from two independent cohorts, NACC (n = 3,174, training set) and UPitt (n = 2,053, validation set). One standard deviation increase in the MetaGRS is associated with about 57% increase in the AD risk [hazard ratio (HR) = 1.577, p = 7.17 E-56], showing little difference from the HR for AD GRS alone (HR = 1.579, p = 1.20E-56), suggesting similar utility of both models. We also conducted APOE-stratified analyses to assess the role of the e4 allele on risk prediction. Similar to that of the combined model, our stratified results did not show a considerable improvement of the MetaGRS. Our study showed that the prediction power of the MetaGRS significantly outperformed that of the reference model without any genetic information, but was effectively equivalent to the prediction power of the AD GRS.

12.
medRxiv ; 2023 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-37461624

RESUMO

Limited ancestral diversity has impaired our ability to detect risk variants more prevalent in non-European ancestry groups in genome-wide association studies (GWAS). We constructed and analyzed a multi-ancestry GWAS dataset in the Alzheimer's Disease (AD) Genetics Consortium (ADGC) to test for novel shared and ancestry-specific AD susceptibility loci and evaluate underlying genetic architecture in 37,382 non-Hispanic White (NHW), 6,728 African American, 8,899 Hispanic (HIS), and 3,232 East Asian individuals, performing within-ancestry fixed-effects meta-analysis followed by a cross-ancestry random-effects meta-analysis. We identified 13 loci with cross-ancestry associations including known loci at/near CR1 , BIN1 , TREM2 , CD2AP , PTK2B , CLU , SHARPIN , MS4A6A , PICALM , ABCA7 , APOE and two novel loci not previously reported at 11p12 ( LRRC4C ) and 12q24.13 ( LHX5-AS1 ). Reflecting the power of diverse ancestry in GWAS, we observed the SHARPIN locus using 7.1% the sample size of the original discovering single-ancestry GWAS (n=788,989). We additionally identified three GWS ancestry-specific loci at/near ( PTPRK ( P =2.4×10 -8 ) and GRB14 ( P =1.7×10 -8 ) in HIS), and KIAA0825 ( P =2.9×10 -8 in NHW). Pathway analysis implicated multiple amyloid regulation pathways (strongest with P adjusted =1.6×10 -4 ) and the classical complement pathway ( P adjusted =1.3×10 -3 ). Genes at/near our novel loci have known roles in neuronal development ( LRRC4C, LHX5-AS1 , and PTPRK ) and insulin receptor activity regulation ( GRB14 ). These findings provide compelling support for using traditionally-underrepresented populations for gene discovery, even with smaller sample sizes.

13.
Hum Mol Genet ; 31(R1): R62-R72, 2022 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-35943817

RESUMO

Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS). In addition, the growing popularity of disease-specific whole-genome sequencing (WGS) efforts expands the library of and offers unique opportunities for investigating both common and rare non-coding variants, which are typically not detected in more limited GWAS approaches. However, the sheer size and breadth of WGS data introduce additional challenges to predicting functional impacts in terms of data analysis and interpretation. This review focuses on the recent approaches developed for efficient, at-scale annotation and prioritization of non-coding variants uncovered in WGS analyses. In particular, we review the latest scalable annotation tools, databases and functional genomic resources for interpreting the variant findings from WGS based on both experimental data and in silico predictive annotations. We also review machine learning-based predictive models for variant scoring and prioritization. We conclude with a discussion of future research directions which will enhance the data and tools necessary for the effective functional analyses of variants identified by WGS to improve our understanding of disease etiology.


Assuntos
Epigênese Genética , Estudo de Associação Genômica Ampla , Sequenciamento Completo do Genoma , Genômica
14.
J Alzheimers Dis ; 89(1): 1-12, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35848019

RESUMO

The success of genome-wide association studies (GWAS) completed in the last 15 years has reinforced a key fact: polygenic architecture makes a substantial contribution to variation of susceptibility to complex disease, including Alzheimer's disease. One straight-forward way to capture this architecture and predict which individuals in a population are most at risk is to calculate a polygenic risk score (PRS). This score aggregates the risk conferred across multiple genetic variants, ultimately representing an individual's predicted genetic susceptibility for a disease. PRS have received increasing attention after having been successfully used in complex traits. This has brought with it renewed attention on new methods which improve the accuracy of risk prediction. While these applications are initially informative, their utility is far from equitable: the majority of PRS models use samples heavily if not entirely of individuals of European descent. This basic approach opens concerns of health equity if applied inaccurately to other population groups, or health disparity if we fail to use them at all. In this review we will examine the methods of calculating PRS and some of their previous uses in disease prediction. We also advocate for, with supporting scientific evidence, inclusion of data from diverse populations in these existing and future studies of population risk via PRS.


Assuntos
Doença de Alzheimer , Estudo de Associação Genômica Ampla , Doença de Alzheimer/genética , Predisposição Genética para Doença/genética , Humanos , Herança Multifatorial/genética , Fatores de Risco
15.
J Oncol Pharm Pract ; : 10781552221104773, 2022 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-35698761

RESUMO

INTRODUCTION: Biosimilars confer significant cost-saving advantages and expand patients' access to biologic therapies in cancer care. In line with the increasing availability of antineoplastic biosimilars, it is pertinent to understand the oncologists' view on the adoption of biosimilars in their clinical practice. The study aimed to assess (i) the prevalence of biosimilar use, (ii) perception towards biosimilars, (iii) factors influencing the use of biosimilars and (iv) knowledge about biosimilars among Malaysian oncologists. METHODS: A cross-sectional survey was conducted among clinical oncologists and medical oncologists in Malaysia between January 2020 and February 2021 using a structured 31-item questionnaire. RESULTS: Among the 121 oncologists registered in the country, 36 responded (response rate = 30%). A total of 64% of the respondents prescribed biosimilars either often or always. Most oncologists (72%) agreed or strongly agreed that switching will not have a significant effect on the treatment benefit, with lower percentages saying that they agreed or strongly agreed that it will not lead to the emergence of additional adverse effects (56%) or harmful immunogenicity (64%). Patients' preferences (40%) and the non-availability of biosimilars in hospitals (34%) are the major barriers cited to the prescribing of biosimilars. Cost differences and robust pharmacovigilance activities are the two most important factors that would influence the prescribing of biosimilars. The mean score of knowledge in biosimilar among respondents was 3.81 (± 0.86) out of a maximum possible score of 6. CONCLUSIONS: The identified gap in prescribing and the use of biosimilars among Malaysian oncologists warrant educational intervention and robust pharmacovigilance activities to facilitate the prescribing of biosimilars and ultimately increase the accessibility to biologics in cancer treatment.

16.
Genomics Proteomics Bioinformatics ; 20(6): 1197-1206, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-35085778

RESUMO

We aimed to develop a whole-genome sequencing (WGS)-based copy number variant (CNV) calling algorithm with the potential of replacing chromosomal microarray assay (CMA) for clinical diagnosis. JAX-CNV is thus developed for CNV detection from WGS data. The performance of this CNV calling algorithm was evaluated in a blinded manner on 31 samples and compared to the 112 CNVs reported by clinically validated CMAs for these 31 samples. The result showed that JAX-CNV recalled 100% of these CNVs. Besides, JAX-CNV identified an average of 30 CNVs per individual, respresenting an approximately seven-fold increase compared to calls of clinically validated CMAs. Experimental validation of 24 randomly selected CNVs showed one false positive, i.e., a false discovery rate (FDR) of 4.17%. A robustness test on lower-coverage data revealed a 100% sensitivity for CNVs larger than 300 kb (the current threshold for College of American Pathologists) down to 10× coverage. For CNVs larger than 50 kb, sensitivities were 100% for coverages deeper than 20×, 97% for 15×, and 95% for 10×. We developed a WGS-based CNV pipeline, including this newly developed CNV caller JAX-CNV, and found it capable of detecting CMA-reported CNVs at a sensitivity of 100% with about a FDR of 4%. We propose that JAX-CNV could be further examined in a multi-institutional study to justify the transition of first-tier genetic testing from CMAs to WGS. JAX-CNV is available at https://github.com/TheJacksonLaboratory/JAX-CNV.


Assuntos
Algoritmos , Variações do Número de Cópias de DNA , Humanos , Sequenciamento Completo do Genoma
17.
Front Aging Neurosci ; 14: 1073905, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36846102

RESUMO

Dozens of single nucleotide polymorphisms (SNPs) related to Alzheimer's disease (AD) have been discovered by large scale genome-wide association studies (GWASs). However, only a small portion of the genetic component of AD can be explained by SNPs observed from GWAS. Structural variation (SV) can be a major contributor to the missing heritability of AD; while SV in AD remains largely unexplored as the accurate detection of SVs from the widely used array-based and short-read technology are still far from perfect. Here, we briefly summarized the strengths and weaknesses of available SV detection methods. We reviewed the current landscape of SV analysis in AD and SVs that have been found associated with AD. Particularly, the importance of currently less explored SVs, including insertions, inversions, short tandem repeats, and transposable elements in neurodegenerative diseases were highlighted.

18.
Cancer Res ; 82(4): 543-555, 2022 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-34903603

RESUMO

Alternatively spliced RNA isoforms are a hallmark of tumors, but their nature, prevalence, and clinical implications in gastric cancer have not been comprehensively characterized. We systematically profiled the splicing landscape of 83 gastric tumors and matched normal mucosa, identifying and experimentally validating eight splicing events that can classify all gastric cancers into three subtypes: epithelial-splicing (EpiS), mesenchymal-splicing (MesS), and hybrid-splicing. These subtypes were associated with distinct molecular signatures and epithelial-mesenchymal transition markers. Subtype-specific splicing events were enriched in motifs for splicing factors RBM24 and ESRP1, which were upregulated in MesS and EpiS tumors, respectively. A simple classifier based only on RNA levels of RBM24 and ESRP1, which can be readily implemented in the clinic, was sufficient to distinguish gastric cancer subtypes and predict patient survival in multiple independent patient cohorts. Overall, this study provides insights into alternative splicing in gastric cancer and the potential clinical utility of splicing-based patient classification. SIGNIFICANCE: This study presents a comprehensive analysis of alternative splicing in the context of patient classification, molecular mechanisms, and prognosis in gastric cancer.


Assuntos
Processamento Alternativo , Transição Epitelial-Mesenquimal/genética , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Neoplasias Gástricas/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Linhagem Celular Tumoral , Análise por Conglomerados , Feminino , Humanos , Estimativa de Kaplan-Meier , Masculino , Pessoa de Meia-Idade , Fatores de Processamento de RNA/genética , Proteínas de Ligação a RNA/genética , RNA-Seq/métodos , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Neoplasias Gástricas/classificação
19.
Front Genet ; 12: 710055, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34795690

RESUMO

The explosion of biobank data offers unprecedented opportunities for gene-environment interaction (GxE) studies of complex diseases because of the large sample sizes and the rich collection in genetic and non-genetic information. However, the extremely large sample size also introduces new computational challenges in G×E assessment, especially for set-based G×E variance component (VC) tests, which are a widely used strategy to boost overall G×E signals and to evaluate the joint G×E effect of multiple variants from a biologically meaningful unit (e.g., gene). In this work, we focus on continuous traits and present SEAGLE, a Scalable Exact AlGorithm for Large-scale set-based G×E tests, to permit G×E VC tests for biobank-scale data. SEAGLE employs modern matrix computations to calculate the test statistic and p-value of the GxE VC test in a computationally efficient fashion, without imposing additional assumptions or relying on approximations. SEAGLE can easily accommodate sample sizes in the order of 105, is implementable on standard laptops, and does not require specialized computing equipment. We demonstrate the performance of SEAGLE using extensive simulations. We illustrate its utility by conducting genome-wide gene-based G×E analysis on the Taiwan Biobank data to explore the interaction of gene and physical activity status on body mass index.

20.
Front Genet ; 12: 752390, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34804120

RESUMO

Alzheimer's Disease (AD) is a progressive neurologic disease and the most common form of dementia. While the causes of AD are not completely understood, genetics plays a key role in the etiology of AD, and thus finding genetic factors holds the potential to uncover novel AD mechanisms. For this study, we focus on copy number variation (CNV) detection and burden analysis. Leveraging whole-genome sequence (WGS) data released by Alzheimer's Disease Sequencing Project (ADSP), we developed a scalable bioinformatics pipeline to identify CNVs. This pipeline was applied to 1,737 AD cases and 2,063 cognitively normal controls. As a result, we observed 237,306 and 42,767 deletions and duplications, respectively, with an average of 2,255 deletions and 1,820 duplications per subject. The burden tests show that Non-Hispanic-White cases on average have 16 more duplications than controls do (p-value 2e-6), and Hispanic cases have larger deletions than controls do (p-value 6.8e-5).

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...